I Deleted Everything And Asked An AI To Rebuild It
I deleted everything. The codebase was too big. Files were scattered. Agents were confused. The mess had won. So I pressed delete. Then I gave an AI a prompt. Now it is building the full app. I do not trust it enough for production. I will check the code. That is the plan. That is the hope.
Sometimes the best way to fix a tangled codebase is to burn it down and ask someone else to rebuild it. The someone else is an AI. The rebuilding is in progress. The anxiety is real.
The Reason
The old codebase had grown beyond control. Features were layered on features. Configuration files multiplied. Agents could not navigate the structure. Training scripts conflicted. Benchmarking logic duplicated. The complexity obscured the signal. I could not see what worked. I could not see what failed. I could only see the mess.
Deleting everything felt extreme. It also felt necessary. A clean slate removes ambiguity. A clean slate forces clarity. A clean slate lets an AI start from first principles. That is the gamble. That is the experiment.
The Full Prompt
Here is the exact prompt I gave to the AI. It is long. It is specific. It is ambitious. I am sharing it because transparency matters. Because others might learn from it. Because I want to be held accountable to what I asked for.
What The Prompt Asks For
The prompt defines a complete training pipeline for sub one million parameter models. It specifies data sources, precision settings, architecture choices, benchmarking requirements, and a five-phase development workflow. It sets measurable targets: 62.7 percent BLiMP accuracy, 30.0 percent ARC accuracy, 2.166 bits per byte.
It requires the AI to research before implementing. To experiment before finalizing. To verify before declaring success. It demands that every kept feature be added to the fullest extent. It insists on hyper efficient training and blazing fast inference. It forbids skipping steps because the AI does not feel like it.
The prompt is a contract. It is also a challenge. It is also a test of whether an AI can build what I have struggled to build myself. That is the gamble. That is the experiment.
Why This Might Work
The prompt is specific. It sets clear constraints. It defines measurable targets. It forces the AI to research before implementing. It requires experimentation before finalizing. It demands verification before declaring success. This structure reduces hallucination. It increases accountability.
I will check the code. I will run the tests. I will validate the benchmarks. The AI builds. I verify. That division of labor leverages AI speed and human judgment. That balance might produce a working system. It might also produce a spectacular failure. Both outcomes teach me something.
Delegation is just trusting someone else to do the work while you watch nervously. I am delegating to an AI. I am watching nervously. The work is in progress.
What I Am Giving Up
I am giving up control over implementation details. I am giving up the comfort of writing every line myself. I am giving up the illusion that I understand every optimization. In return I gain speed. I gain fresh perspectives. I gain the chance to learn from an AI that does not share my blind spots.
The trade-off is intentional. The risk is acknowledged. The reward is uncertain. That uncertainty is the point. That uncertainty is the opportunity.
Final Thoughts
I deleted everything. I asked an AI to rebuild it. The prompt is long but simple. The goal is ambitious but measurable. The process is structured but flexible. I will check the code. I will validate the results. I will learn from the outcome.
If it works, I will have a hyper efficient training pipeline for sub one million parameter models. If it fails, I will have a detailed log of why. Both outcomes advance the project. Both outcomes justify the gamble.
Thank you for watching the reset. Thank you for accepting that sometimes starting over is the only way forward. Thank you for believing that tiny models can still surprise us. The rebuild is underway. The progress is weird. The hope is real.